AITopics | current situation

Collaborating Authors

current situation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

6b8dfb8c0c12e6fafc6c256cb08a5ca7-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 20:44:47 GMT

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Vietnam > Hanoi > Hanoi (0.04)
Asia > China > Beijing > Beijing (0.04)
(2 more...)

Industry:

Materials > Metals & Mining (0.96)
Leisure & Entertainment > Games (0.72)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)
(2 more...)

Add feedback

Validating Generative Agent-Based Models of Social Norm Enforcement: From Replication to Novel Predictions

Cross, Logan, Haber, Nick, Yamins, Daniel L. K.

arXiv.org Artificial IntelligenceJul-30-2025

As large language models (LLMs) advance, there is growing interest in using them to simulate human social behavior through generative agent-based modeling (GABM). However, validating these models remains a key challenge. We present a systematic two-stage validation approach using social dilemma paradigms from psychological literature, first identifying the cognitive components necessary for LLM agents to reproduce known human behaviors in mixed-motive settings from two landmark papers, then using the validated architecture to simulate novel conditions. Our model comparison of different cognitive architectures shows that both persona-based individual differences and theory of mind capabilities are essential for replicating third-party punishment (TPP) as a costly signal of trustworthiness. For the second study on public goods games, this architecture is able to replicate an increase in cooperation from the spread of reputational information through gossip. However, an additional strategic component is necessary to replicate the additional boost in cooperation rates in the condition that allows both ostracism and gossip. We then test novel predictions for each paper with our validated generative agents. We find that TPP rates significantly drop in settings where punishment is anonymous, yet a substantial amount of TPP persists, suggesting that both reputational and intrinsic moral motivations play a role in this behavior. For the second paper, we introduce a novel intervention and see that open discussion periods before rounds of the public goods game further increase contributions, allowing groups to develop social norms for cooperation. This work provides a framework for validating generative agent models while demonstrating their potential to generate novel and testable insights into human social behavior.

architecture, artificial intelligence, natural language, (17 more...)

arXiv.org Artificial Intelligence

2507.22049

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Social Sector (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.68)

Add feedback

Feedback-Induced Performance Decline in LLM-Based Decision-Making

Yang, Xiao, Leitner, Juxi, Burke, Michael

arXiv.org Artificial IntelligenceJul-22-2025

The ability of Large Language Models (LLMs) to extract context from natural language problem descriptions naturally raises questions about their suitability in autonomous decision-making settings. This paper studies the behaviour of these models within a Markov Decision Process (MDPs). While traditional reinforcement learning (RL) strategies commonly employed in this setting rely on iterative exploration, LLMs, pre-trained on diverse datasets, offer the capability to leverage prior knowledge for faster adaptation. We investigate online structured prompting strategies in sequential decision making tasks, comparing the zero-shot performance of LLM-based approaches to that of classical RL methods. Our findings reveal that although LLMs demonstrate improved initial performance in simpler environments, they struggle with planning and reasoning in complex scenarios without fine-tuning or additional guidance. Our results show that feedback mechanisms, intended to improve decision-making, often introduce confusion, leading to diminished performance in intricate environments. These insights underscore the need for further exploration into hybrid strategies, fine-tuning, and advanced memory integration to enhance LLM-based decision-making capabilities.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.14906

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

What to Do Next? Memorizing skills from Egocentric Instructional Video

Bi, Jing, Xu, Chenliang

arXiv.org Artificial IntelligenceJul-8-2025

Learning to perform activities through demonstration requires extracting meaningful information about the environment from observations. In this research, we investigate the challenge of planning high-level goal-oriented actions in a simulation setting from an egocentric perspective. W e present a novel task, interactive action planning, and propose an approach that combines topological affordance memory with transformer architecture. The process of memorizing the environment's structure through extracting af-fordances facilitates selecting appropriate actions based on the context. Moreover, the memory model allows us to detect action deviations while accomplishing specific objectives. T o assess the method's versatility, we evaluate it in a realistic interactive simulation environment. Our experimental results demonstrate that the proposed approach learns meaningful representations, resulting in improved performance and robust when action deviations occur .

information, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2507.02997

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision (0.96)
(5 more...)

Add feedback

Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs

Laine, Rudolf, Chughtai, Bilal, Betley, Jan, Hariharan, Kaivalya, Scheurer, Jeremy, Balesni, Mikita, Hobbhahn, Marius, Meinke, Alexander, Evans, Owain

arXiv.org Artificial IntelligenceJul-5-2024

AI assistants such as ChatGPT are trained to respond to users by saying, "I am a large language model". This raises questions. Do such models know that they are LLMs and reliably act on this knowledge? Are they aware of their current circumstances, such as being deployed to the public? We refer to a model's knowledge of itself and its circumstances as situational awareness. To quantify situational awareness in LLMs, we introduce a range of behavioral tests, based on question answering and instruction following. These tests form the $\textbf{Situational Awareness Dataset (SAD)}$, a benchmark comprising 7 task categories and over 13,000 questions. The benchmark tests numerous abilities, including the capacity of LLMs to (i) recognize their own generated text, (ii) predict their own behavior, (iii) determine whether a prompt is from internal evaluation or real-world deployment, and (iv) follow instructions that depend on self-knowledge. We evaluate 16 LLMs on SAD, including both base (pretrained) and chat models. While all models perform better than chance, even the highest-scoring model (Claude 3 Opus) is far from a human baseline on certain tasks. We also observe that performance on SAD is only partially predicted by metrics of general knowledge (e.g. MMLU). Chat models, which are finetuned to serve as AI assistants, outperform their corresponding base models on SAD but not on general knowledge tasks. The purpose of SAD is to facilitate scientific understanding of situational awareness in LLMs by breaking it down into quantitative abilities. Situational awareness is important because it enhances a model's capacity for autonomous planning and action. While this has potential benefits for automation, it also introduces novel risks related to AI safety and control. Code and latest results available at https://situational-awareness-dataset.org .

introspection, monologue task, plain prompt situating prompt 0, (13 more...)

arXiv.org Artificial Intelligence

2407.04694

Country:

North America > United States > New York (0.04)
Europe > France (0.04)
Asia > Azerbaijan (0.04)
(9 more...)

Genre: Research Report > New Finding (0.45)

Industry:

Government > Military (1.00)
Leisure & Entertainment > Sports > Olympic Games (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Forewarn: Business growth with current situation of AI in Construction Market - DataScienceCentral.com

#artificialintelligenceSep-9-2022, 15:20:35 GMT

Today, AI in construction industry has become a common tool for carrying out many construction activities. In addition, many big companies in the construction industry all across the globe are immensely adopting AI as it boasts a multitude of applications. AI has the ability to accurately evaluate the cost overrun of a project, on the basis of factors such as type of contract, size, and also the level of competence of the managers to risk moderation via self-driving machinery and equipment.

artificial intelligence, construction industry, intelligence, (13 more...)

#artificialintelligence

Country: Asia > India > Maharashtra > Pune (0.05)

Industry:

Construction & Engineering (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.61)
Health & Medicine > Therapeutic Area > Immunology (0.61)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Hariri

AAAI ConferencesFeb-8-2022, 12:42:46 GMT

Recommender systems have become essential tools in many application areas as they help alleviate information overload by tailoring their recommendations to users' personal preferences. Users' interests in items, however, may change over time depending on their current situation. Without considering the current circumstances of a user, recommendations may match the general preferences of the user, but they may have small utility for the user in his/her current situation.We focus on designing systems that interact with the user over a number of iterations and at each step receive feedback from the user in the form of a reward or utility value for the recommended items. The goal of the system is to maximize the sum of obtained utilities over each interaction session. We use a multi-armed bandit strategy to model this online learning problem and we propose techniques for detecting changes in user preferences. The recommendations are then generated based on the most recent preferences of a user. Our evaluation results indicate that our method can improve the existing bandit algorithms by considering the sudden variations in the user's feedback behavior.

current situation, hariri, recommendation

AAAI Conferences

Industry: Education (0.64)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.64)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.64)

Add feedback

Pinaki Laskar on LinkedIn: #AGI #Sensors #AI

#artificialintelligenceAug-11-2021, 05:40:31 GMT

AI Researcher, Cognitive Technologist Inventor - AI Thinking, Think Chain Innovator - AIOT, XAI, Autonomous Cars, IIOT Founder Fisheyebox Spatial Computing Savant, Transformative Leader, Industry X.0 Practitioner Understanding #AGI architecture, includes effectors, which are understood as sensors and actuators. Effectors are a kind of ambivalent element, being controlled by the AGI system and at the same time being part of the natural or virtual embodiment. Both sensors and actuators imply "smart" devices/units that carry a two-way exchange of information with the AGI system itself. AGI sends commands to the effector and receives data in response. From the AGI point of view, the difference between sensors and actuators is that the purpose of sensors is to collect information about the current situation, and actuators are to change the situation.

effector, information, sensor and actuator, (5 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.85)

Add feedback

Forewarn: Business growth with current situation of AI in Construction Market

#artificialintelligenceJun-26-2021, 22:50:52 GMT

artificial intelligence, construction industry, intelligence, (12 more...)

#artificialintelligence

Country: Asia > India > Maharashtra > Pune (0.05)

Industry:

Construction & Engineering (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.61)
Health & Medicine > Therapeutic Area > Immunology (0.61)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Clustering U.S counties by their COVID-19 curves

#artificialintelligenceNov-29-2020, 05:20:37 GMT

Analytics has become a part of everyone's daily routine as a result of the pandemic. Every day we look at curves of new cases, positivity rates, and a range of other metrics that give us insight into our current situation. One interesting metric used by the CDC, along with many news networks and publications is hotspot classification. A hotspot is a county or state where cases are currently increasing at a relatively high rate [1]. This metric is simple and easy to understand. But it leaves out a lot of interesting details.

algorithm, covid-19 case, spike, (16 more...)

#artificialintelligence

Country: North America > United States > Washington > Clark County (0.05)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.43)
Health & Medicine > Therapeutic Area > Immunology (0.43)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback